A Modified Speech Enhancement Using Adaptive Gain Equalizer with Non linear Spectral Subtraction for Robust Speech Recognition
نویسندگان
چکیده
In this paper we present an enhanced noise reduction method for robust speech recognition using Adaptive Gain Equalizer with Non linear Spectral Subtraction. In Adaptive Gain Equalizer method (AGE), the input signal is divided into a number of subbands that are individually weighed in time domain, in accordance to the short time Signal-to-Noise Ratio (SNR) in each subband estimation at every time instant. Instead of focusing on suppression the noise on speech enhancement is focused. When analysis was done under various noise conditions for speech recognition, it was found that Adaptive Gain Equalizer method algorithm has an obvious failing point for a SNR of -5 dB, with inadequate levels of noise suppression for SNR less than this point. This work proposes the implementation of AGE when coupled with Non linear Spectral Subtraction (AGE-NSS) for robust speech recognition. The experimental result shows that out AGE-NSS performs the AGE when SNR drops below -5db level. Keywords—Adaptive Gain Equalizer, Non Linear Spectral Subtraction, Speech Enhancement, and Speech Recognition.
منابع مشابه
Improving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملSpeech Enhancement based on Linear Prediction Error Signals and Spectral Subtraction
Speech processing and recognition are key technologies to produce smart user interfaces in an increasing number of devices. Moreover, robust speech recognition is considered mandatory for a reliable operation of such elements in realistic working conditions. Through this paper, a method of processing speech degraded by noise and reverberation is proposed. This approach involves analyzing the pr...
متن کاملRobust automatic continuous-speech recognition based on a voiced-unvoiced decision
In this paper, the implementation of a robust front-end to be used for a large-vocabulary Continuous Speech Recognition (CSR) system based on a Voiced-Unvoiced (V-U) decision has been addressed. Our approach is based on the separation of the speech signal into voiced and unvoiced components. Consequently, speech enhancement can be achieved through processing of the voiced and the unvoiced compo...
متن کاملSpeech enhancement for a car environment using LP residual signal and spectral subtraction
Handsfree speaker input is mandatory to enable safe operation in cars. In those scenarios robust speech recognition emerges as one of the key technologies to produce voice control car devices. Through this paper, we propose a method of processing speech degraded by reverberation and noise in an automobile environment. This approach involves analyzing the linear prediction error signal to produc...
متن کاملRobust Speech Recognition Using Speech Enhancement
Automatic Speech Recognition (ASR) has matured into a technology which is becoming more common in our everyday lives, and is emerging as a necessity to minimise driver distraction when operating in-car systems such as navigation and infotainment. In “noise-free” environments, word recognition performance of these systems has been shown to approach 100%, however this performance degrades rapidly...
متن کامل